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BLAST 



Basic Local Alignment Search Tool 



• Your search parameters were adjusted to search for a short input sequence, 
edit ana Resubmit S&ve Sasten .Strategies rotmMtina QdMq^ 
Protein Sequence (8 letters) 



residues 70-77 of seq id 12 



Results for: |lcl|73700 None(8aa) H 
Your BLAST job specified more than one input sequence. This box lets you choose which input sequence 
to show BLAST results for. 



Query ID 

Id 73700 
Description 

None 
Molecule type 

amino acid 
Query Length 

8 



Database Name 

nr 

Description 

All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental 
samples from WGS projects 
Program 

BLASTP 2.2.22+ Citation 



BejsreQce 

Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb 
Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs", Nucleic Acids Res. 25:3389-3402. 
Other reports < • * >j iMuiht : N N new 



Search Parameters 

Program blastp 

Word size 2 

Expect value 200000 

Hitlistsize 100 

Gapcosts 9,1 

Matrix PAM30 

Threshold 1 1 

Filter string F 

Genetic Code 1 

Window Size 40 



Database 

Posted date Jan 3, 201 0 5:44 PM 

Number of letters 3,505,793,397 
Number of sequences 1 0,274,250 
Entrez query none 
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Karlin-Altschul statistics 

Params Ungapped Gapped 

Lambda 0.36992 0.294 

K 0.287801 0.11 

H 1.64552 0.61 



Results Statistics 

Length adjustment 0 
Effective length of query 8 
Effective length of database 3505793397 
Effective search space 280463471 76 
Effective search space used 280463471 76 



Show Conserved Domains 

No putative conserved domains have been detected 



Distribution of 122 Blast Hits on the Query Sequence 



[21 



An overview of the database sequences aligned to the query sequence is shown. The score of each 
alignment is indicated by one of five different colors, which divides the range of scores into five groups. 
Multiple alignments on the same database sequence are connected by a striped line. Mousing over a hit 
sequence causes the definition and score to be shown in the window at the top, clicking on a hit 
sequence takes the user to the associated alignments. New: This graphic is an overview of database 
sequences aligned to the query sequence. Alignments are color-coded by score, within one of five score 
ranges. Multiple alignments on the same database sequence are connected by a dashed line. Mousing 
over an alignment shows the alignment definition and score in the box at the top. Clicking an alignment 
displays the alignment detail. 



Color key for alignment scores 
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Sequences producing significant alignments: 



(Bits) Value 



thioredoxin reductase [uncultured archaeon] 
putative NADPH-dependent FMN reductase [Halomo. 
.1! amino acid permease [Corynebacterium effic. 
hypothetical protein NECHADRAFT_70309 [Nectria. 
,}.'■ NCS2 family nucleobase : cation symporter-2 . 
hypothetical protein Pmar_PMAR006502 [Perkinsu. 
hypothetical protein Pmar_PMAR016316 [Perkinsu. 
8.2 J PREDICTED: hypothetical protein [Homo sap. 
3 . i glycosyl transferase, family 2 [Xylanimon. 
3.1! predicted protein [Micromonas sp. RCC299]. 
: . ; conserved hypothetical protein [Ricinus c. 

NAD dependent epimerase/dehydratase, puta. 
6.1 i hypothetical protein BRAFLDRAFT_7 5502 [Br. 
.5.2! GA10819 [Drosophila pseudoobscura pseudoo. 
2.1! GK20941 [Drosophila willistoni] >gb|EDW72. 
6.1; GJ12254 [Drosophila virilis] >gb I EDW69208 . 
& . i. ! GL20243 [Drosophila persimilis] >gb|EDW27. 
vacuolar targeting protein Atg24, putative [As. 
zinc finger protein [Culex quinquef asciat . 
i . 1 ! predicted protein [ Physcomitrella patens . 
sprouty 2 [Pantherophis guttatus] 
.1! putative glutamate-ammonia-ligase adenylyl . 
2.1; PREDICTED: hypothetical protein [Mus muse. 

PREDICTED: hypothetical protein [Mus musculus 
0.1 i predicted protein [Magnaporthe grisea 70-. 
1.1 ! hypothetical protein [Trichomonas vaginal. 

hypothetical protein AAur_0278 [ Arthrobacter . 
8.1! vacuolar targeting protein Atg24, putativ. 
! NADH dehydrogenase subunit 5 [Apus apus] >em. 

hypothetical protein APE_2029.1 [Aeropyrum p. 
8 . 1 hypothetical protein SNOG_06291 [Phaeosph. 
1.1} PREDICTED: hypothetical protein [Rattus n. 
hypothetical protein SAV_3761 [ Streptomyces . 
surface antigen [Hepatitis B virus] 

negative regulator of flagellin synthesis Fl. 
1 . 1 adenosine A2a receptor a [Danio rerio] >g. 
ACR265Cp [Ashbya gossypii ATCC 10895] >gb|AA. 
phage-related putative exported protein [Bor. 
putative amino-acid permease [Corynebacter iu . 
vacuolar targeting protein Atg24 [Aspergillu. 
.1! conserved hypothetical protein [Streptomyc. 

hypothetical protein MGG_00517 [Magnaporthe . 
. .1 | AMP-dependent synthetase and ligase [Caulo. 

putative secreted phytase [Streptomyces scabi . 
.;: long-chain-f atty-acid--CoA ligase [Mycobac. 
.:: acyl-CoA synthetase [Mycobacterium intrace. 
.:: acyl-CoA synthetase [Mycobacterium kansasi. 
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■■; 0 : 7,7 ...3400 'ih 2 ■ .'! : hypothetical protein MCAG_00186 [Micromono . . . 2 . 3 5541 

gb ; r.: •';•;■) 4 8 '7 .. .:. I predicted protein [Ajellomyces capsulatus G186AR] 2.2 .3 5541 

x •;• 0 \ >:■■__ 000 0 3 6503 . acyl-CoA synthetase [Caulobacter crescent... 32 . .7 5541 

; 07 0 '■ oOOo. ! two-component system sensor kinase [Strept... 2 ,? . 3 5541 

ir ;: : :P ... .00 i 7 0:; 66 1 , i putative fatty-acid — CoA ligase FadD [Myc... 22. 3 5541 

i-::'::::: 7 ''600 . ! FadD28 protein [Mycobacterium chelonae] 3 .:. .■> 5541 

?: o7 : 7;-... 30 7000 . 1 acyl-CoA synthase [Mycobacterium smegmatis s... 00.3 5541 

?: ■■; 7 : 77' ...0' : -'0 -0 1 . acyl-CoA synthetase [Mycobacterium avium 104... .3.3 5541 

■: 7 : 1773. .3 77 '73.: . .'; : acyl-CoA synthetase [Mycobacterium avium sub... 33.3 5541 

3 : 77:3.. 7 7 0031 . ; acyl-CoA synthetase [Caulobacter crescentus ... 33.3 5541 

:i:7 : ■-77.. 00 .1 .7 7 0 30 7 . DNA polymerase III subunits gamma and tau. . . 22. s 5541 

00 : 7.77; 00 '7 0 3 . : large S protein [Hepatitis B virus] 0;.0 7434 

go : 7.7 "OvoOOO . : : S protein [Hepatitis B virus] 7 7.3 7434 

gb : .7.7.05554 3 . 7 ; profilaggrin [Homo sapiens] 3 7.3 7434 

■3.0- oO.AAOo. o ,37 , 7 profilaggrin [Homo sapiens] 0 1 . 0 7434 

30 3 j OOgOOOoOO . '; : filaggrin [Homo sapiens] >sp | P20930 . 3 I FILA_H . . . 5;. 6 7434 

30 7 : 73.. 00 7 50000 0 . 7 : integrase catalytic region [Frankia sp. E... 2 .'; . 0 7434 

7o7 73 3. 0 4007 00 0 . g. ; fibronectin type III domain containing pro... 2.1... .4 9975 

egg: 003 03 3 00 0 .11 unnamed protein product [Ostreococcus tauri] 0.0 , 4 9975 

gb : .770. 73030 ,1; FIP2 [Homo sapiens] >gb I AAG00497 . 1 1 FIP2 [Homo... 21.4. 9975 

gb:5ACi50 4'7, 3 : FIP2 [Homo sapiens] 21,4 9975 

g7. ; 770\2 7 70 j. . 1 ; 2-hydroxy-6-oxo-6-phenylhexa-2 , 4-dienoate hydr . . . 21. 4 9975 

rt;:;3P...010sOO2O.l; hypothetical protein NAP1_04965 [Erythroba . . . 21 , 4 9975 

:o^LL?.!LJ3bo ^.b-^'Ob^bbJ. FHA domain-containing protein [ Sanguibact . . . 2 • . 0 13384 

.VO^O-: .;. 'OP ..... C; . : 333-. ; 3-.':-5 3 0 : 6 , .1 .;. DNA topoisomerase I [ Actinosynnema mirum . . . 21. 0 133 8 4 

?:ef ; 33-...002900S27 . :; ; f lagellin [Desulf ovibrio magnet icus RS-1] . . . 2.1. .0 13384 

? 3000 . ; . 30 V; ... 0 4 g 3 7; -4 3 0.7 3 Large exoprotein involved in heme utilizat... 31 .0 13384 

x •;• 7 ^vr. 000 06 4050.'! ; Pc22g00530 [Penicillium chrysogenum Wisco . . . .21 , .0 13384 

ref ;NP... r 001123936, 1 j SMA11 family member (sma-9) [Caenorhabdit . . . 21.0 13384 

oe;: j ;;p... .00 i 1 2833 5 . i. ; SMA11 family member (sma-9) [Caenorhabdit... 2.7, 0 13384 

.0 .--.^ . ; . .3- o^: ,,,3-: .0 .7 ^ . .-. . . : . conserved hypothetical protein [ Aspergill . . . 23, 0 13384 

q D i >.AP 8 6 R 2 . 1. 1 zinc finger transcription factor SMA-9 [Caenor... 02, 0 13384 

gb ; 3O473000 ; ... gb zinc finger transcription factor SMA-9 [Caenor... 01 ,0 13384 

gb! 77700^:340. 0: SMA-9 class B [Caenorhabditis elegans] .21,0 13384 

- -o 0: XT-. 0 060 15. 2j PREDICTED: similar to CG9066-PA [Apis mellif... 2.1 ,0 13384 

vif g4g..0Oi0546'V9 . i j SMA11 family member (sma-9) [Caenorhabdit... 21.0 13384 

0^77. .00 7 034 00 7 , 7 : SMA11 family member (sma-9) [Caenorhabdit... 3 7.0 13384 

;:^3 ; ;;7 ...'•-;:; °37 ., ; SMA11 family member (sma-9) [Caenorhabditis ... 0 7.0 13384 

?:;?f b;7.. .00 ■ 0 ..'■: 4o34 . .; ; SMA11 family member (sma-9) [Caenorhabdit... 3 7.0 13384 

?:g0 iXP.,.001 02 4.8.00,1; SMA11 family member (sma-9) [Caenorhabdit... 2 7 . 0 13384 

rgt ;0;P_pp].024S70 • 1 J SMA11 family member (sma-9) [Caenorhabdit... 3 7.7 13384 

3o3 :577.. 00 7 054 00 7, : ; SMA11 family member (sma-9) [Caenorhabdit... 2.1.0 13384 

3<>7 ; oP.oOio.bbboy. ■ ; SMA11 family member (sma-9) [Caenorhabdit... 21, 0 13384 

r. ef ! .07V-...74 7 096 , 2 SMA11 family member (sma-9) [Caenorhabditis ... 7 13384 

;-;•■;' ;X0-...0026 7 I .7 i C. briggsae CBR-SMA-9 protein [Caenorhabd . . . 0.7.3 13384 

ro 3 ; YP...0000 78 . • ; tannase and feruloyl esterase [Ralstonia met... 3 7.7 13384 

gb ; 07- 777: b: ,00 . .; ; hypothetical protein PANDA_008701 [Ailuropoda ... 30.0 17959 

gb; 0'.;'-7700060 . 7 j conserved domain protein [Staphylococcus epide... 20. 6 17959 

;7g...062 .12040 . 3 YD repeat protein [Acidovorax avenae subsp... 20.6 17959 

00 : ;73?-.00802 . ; : hypothetical protein TcasGA2_TC012010 [Triboli... 30.0 17959 

?:e.f ! ?JP,..00i:; 61.605 . 1 j nanos-like protein [ Saccoglossus kowalevs... 30.3 17959 

r:ef ; OP...00070008 . 7 i adhesin [ Acinetobacter lwoffii SH145] >gb I . . . o0 , 0 17959 
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conserved hypothetical protein [Phytophthora 



Aiig- ^-enis Select All (Vol veered seq^nees Distance tree or rssuKs M-.y^ph ^Cy^^y new 

>emb | CBH37829 . 1 | thioredoxin reductase [uncultured archaeon] 
Length=312 

Expect = 3077 

Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 GSTRTST 7 

GSTRTST 
Sbjct 265 GSTRTST 271 

>gb| ACF77045 . 1 | putative NADPH-dependent FMN reductase [Halomonas sp. HAL1] 
Length=221 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 GSTRTST 7 

GSTRTST 
Sbjct 9 GSTRTST 15 

>ref | ZP_05749499 . 1 | amino acid permease [Corynebacterium efficiens YS-314] 

gb | EEW50349 . 1 | amino acid permease [Corynebacterium efficiens YS-314] 
Length=568 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 STRTSTG 8 

STRTSTG 
Sbjct 543 STRTSTG 549 

Score = 17.2 bits (33), Expect = 188688 

Identities = 5/6 (83%), Positives = 5/6 (83%), Gaps = 0/6 (0%) 

Query 2 STRTST 7 

S RTST 
Sbjct 539 S ART ST 544 



13-4] 
Length=483 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 GSTRTST 7 

GSTRTST 
Sbjct 278 GSTRTST 284 



gb| EER74751 . 1 | NCS2 family nucleobase : cation symporter-2 [Weissella paramesente 
ATCC 33313] 
Length=434 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 STRTSTG 8 

STRTSTG 
Sbjct 4 STRTSTG 10 

>gb| EER18878 . 1 | hypothetical protein Pmar_PMAR006502 [Perkinsus marinus ATCC 

50983] 

Length=50 
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Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 STRTSTG 8 

STRTSTG 
Sbjct 22 STRTSTG 28 

>gb | EER13661 . 1 | hypothetical protein Pmar_PMAR016316 [Perkinsus marinus ATCC 

50983] 

Length=548 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 STRTSTG 8 

STRTSTG 
Sbjct 19 STRTSTG 25 

>ref |XP_001715648 .2 | S PREDICTED: hypothetical protein [Homo sapiens] 
Length=93 

GENE ID: 100134320 LOC100134320 I hypothetical protein LOC100134320 
[Homo sapiens] 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 STRTSTG 8 

STRTSTG 
Sbjct 18 STRTSTG 24 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 STRTSTG 8 

STRTSTG 
Sbjct 28 STRTSTG 34 

>ref | YP_003327263 . 1 | glycosyl transferase, family 2 [Xylanimonas cellulosilytica 
15894] 

gb| ACZ31705 . 1 | glycosyl transferase, family 2 [Xylanimonas cellulosilytica DSM 
15894] 
Length=306 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 GSTRTST 7 

GSTRTST 
Sbjct 293 GSTRTST 299 

>ref |XP_002504543.1| B predicted protein [Micromonas sp. RCC299] 

gb I ACO65801.il predicted protein [Micromonas sp. RCC299] 
Length=559 

GENE ID: 8246104 MICPUN_50521 I hypothetical protein [Micromonas sp. RCC299] 
Score = 23.1 bits (47), Expect = 3077 

Identities = 7/8 (87%), Positives = 8/8 (100%), Gaps = 0/8 (0%) 

Query 1 GSTRTSTG 8 

G+TRTSTG 
Sbjct 83 GTTRTSTG 90 

>ref | XP_002523795 . 1 | B conserved hypothetical protein [Ricinus communis] 

gb| EEF38521 . 1 | conserved hypothetical protein [Ricinus communis] 
Length=1237 

GENE ID: 8271907 RCOM_1280790 I hypothetical protein [Ricinus communis] 
Score = 23.1 bits (47), Expect = 3077 
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Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 STRTSTG 8 

STRTSTG 
Sbjct 440 STRTSTG 446 

>ref |XP_002480603 . 1 | 13 NAD dependent epimerase/dehydratase, putative [Talaromyce 
ATCC 10500] 

gb | EED20169.il NAD dependent epimerase/dehydratase, putative [Talaromyces sti 
ATCC 10500] 
Length=356 

GENE ID: 8097825 TSTA_034080 I NAD dependent epimerase/dehydratase, putative 
[Talaromyces stipitatus ATCC 10500] 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 GSTRTST 7 

GSTRTST 

Sbjct 234 GSTRTST 240 

>ref |XP_002601066.1| S hypothetical protein BRAFLDRAFT_75 5 0 2 [Branchiostoma flor 

gb | EEN57078.il E hypothetical protein BRAFLDRAFT_755 0 2 [Branchiostoma floridae] 
Length=2283 



Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 GSTRTST 7 

GSTRTST 
Sbjct 2266 GSTRTST 2272 

Score = 18.0 bits (35), Expect = 104804 

Identities = 5/5 (100%), Positives = 5/5 (100%), Gaps = 0/5 (0%) 

Query 3 TRTST 7 

TRTST 

Sbjct 2204 TRTST 2208 

>ref | XP_001354725 . 2 | E GA10819 [Drosophila pseudoobscura pseudoobscura ] 

gb| EAL31780 .2 | GA10819 [Drosophila pseudoobscura pseudoobscura] 
Length=728 

GENE ID: 4815191 Dpse\GA10819 I GA10819 gene product from transcript GA10819-RA 
[Drosophila pseudoobscura pseudoobscura] (10 or fewer PubMed links) 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 STRTSTG 8 

STRTSTG 

Sbjct 651 STRTSTG 657 

>ref |XP_002061502.1| E3 GK20941 [Drosophila willistoni] 

gb | EDW72488.il El GK20941 [Drosophila willistoni] 
Length=444 

GENE ID: 6638486 Dwil\GK20941 I GK20941 gene product from transcript GK20941-RA 
[Drosophila willistoni] (10 or fewer PubMed links) 

Score = 23.1 bits (47), Expect = 3077 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 STRTSTG 8 

STRTSTG 
Sbjct 251 STRTSTG 257 
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